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Abstract 

In the field of compressed sensing, a key problem remains open: to explicitly construct matrices with the restricted 
isometry property (RIP) whose performance rivals those generated using random matrix theory. In short, RIP involves 
estimating the singular values of a combinatorially large number of submatrices, seemingly requiring an enormous 
amount of computation in even low-dimensional examples. In this paper, we consider a similar problem involving 
submatrix singular value estimation, namely the problem of explicitly constructing numerically erasure robust frames 
(NERFs). Such frames are the latest invention in a long line of research concerning the design of linear encoders that 
are robust against data loss. We begin by focusing on a subtle difference between the definition of a NERF and that 
of an RIP matrix, one that allows us to introduce a new computational trick for quickly estimating NERF bounds. 
In short, we estimate these bounds by evaluating the frame analysis operator at every point of an e-net for the unit 
sphere. We then borrow ideas from the theory of group frames to construct explicit frames and e-nets with such high 
degrees of symmetry that the requisite number of operator evaluations is greatly reduced. We conclude with numerical 
results, using these new ideas to quickly produce decent estimates of NERF bounds which would otherwise take an 
eternity. Though the more important RIP problem remains open, this work nevertheless demonstrates the feasibility 
of exploiting symmetry to greatly reduce the computational burden of similar combinatorial linear algebra problems. 
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1. Introduction 

Throughout this work, let O denote a short, real MxN matrix; though some of the results presented here generalize 
to the complex setting, others do not. That is, let M < N and let <E> = [ip\ ■ ■ ■ tp^] where {<p„}^ =l £ K M are the columns 
of <J>. For any 'K c {1, . . . ,N}, define K := \K\ and consider the M x K submatrix <J>>/c of <t> obtained by concatenating 
the columns {if„}„eK- Fixing K, both the restricted isometry property and numerically erasure-robust frames are 
defined in terms of the extreme singular values of $>>k as 7C ranges over all Zf-element subsets of {1, . . .,N}; the key 
difference between the two depends on whether the O^'s are tall (K < M) or short (M < K < N). 

In particular, for a fixed K < M and 6 > 0, the matrix <t> is said to have the (K, 5)- restricted isometry property 
(RIP) if for any /f-element subset K c { 1 , . . . , AO, we have 

a-s)J]\y(n)\ 2 < YjyWvn 2 <a + s)2W")i 2 ' vy^ RN - (D 



ne'K 



ne'K 



This means that <t> acts like a near isometry over all vectors whose support is at most K. More precisely, ([D is 
equivalent to having (1 — <5) < HO^zll 2 < (1 +6) for all unit norm z e R K , which in turn is equivalent to having all 
of the eigenvalues of the Gram submatrix O^cD^. lie in the interval [1-6,1+6]. This can be viewed as a weakened 
version of a requirement that Q^Q^ = I. That is, RIP requires every /^-element subcollection {(p n } ne <K of {ip„}^ =l to 
be nearly orthonormal, forming a nice Riesz basis for its span. 
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RIP matrices are important because they permit efficient sensing and reconstruction of sparse signals — see The- 
orem 1.3 in [7], for example — a problem which can otherwise be NP-hard lfl3ll . In particular, for a fixed M, N and 
6 > 0, the goal in such applications is to construct M x N matrices cD which are (K, 6)-RlP where K is as large as 
possible: larger values of K correspond to a larger class of A^-dimensional signals that can be fully sensed using only 
M measurements. Unfortunately, it has proven notoriously difficult to deterministically construct RIP matrices for 
large values of K. To be precise, whereas many randomly constructed matrices J3, Q have a high probability of 
being (K, <S)-RIP for any K < CM /polylog(A0, all known deterministic methods are either stuck at the "square-root 
bottleneck" of having K < C Vm — see for example — or go only slightly beyond it [5]. As detailed in [3], these 
difficulties stem from the fact that checking that a given MxN matrix O is (K, 6)-RlP involves estimating the singular 
values of (^j submatrices of <E> of size M x K. Since singular values are hard to estimate analytically and since QIj is 
enormous for even modest choices of N and K, this problem poses serious challenges for any analytical or numerical 
approach. 

In this paper, we provide a hybrid analytical/numerical approach for solving a problem that is very similar to 
that of deterministically constructing RIP matrices. Though, as we discuss below, our techniques do not immediately 
generalize to the RIP setting, they nevertheless demonstrate the feasibility of numerically estimating the singular 
values of a combinatorially large number of matrices. To be precise, the purpose of this paper is to provide new 
deterministic constructions of numerically erasure- robust frames (NERFs): given M < K < N and < a < ft < oo, 
we say that {<p n }% =J is a (^,a,j6)-NERFfor K M if for any ^-element subset "K c {1, . . . , N}, we have 

a||x|| 2 < J] |<jc, ip„)\ 2 < ft\\x\\\ Vx e R M . (2) 

ne'K 

This means that {(p n } n eK is a frame for R M with frame bounds a, ft regardless of the choice of 7C. Note (0 is equivalent 
to having a < ||<1>1jc|| 2 < ft for all unit norm x e M. M , which in turn is equivalent to having all the eigenvalues of the 
subframe operator lie in the interval [a, ft]. 

Thus, when a ~ ft, we see that a NERF is a short matrix for which every short submatrix of a given size is 
well-conditioned; meanwhile an RIP matrix is a short matrix for which every tall submatrix of a given size is well- 
conditioned. This subtle difference in definition leads to vastly different envisioned applications; while RIP matrices 
are primarily intended for compressed sensing, NERFs are the latest invention lioll in a long line of research fill 0, 
12, 14, 2] concerning the design of linear encoders that are robust against data loss. Here, one encodes x e K M as a 



higher-dimensional vector cD*x € M. N which is then transmitted in a channel with erasures and additive noise, yielding 
y = <t>^.x + e, where K corresponds to the entries of y that were not erased. The problem is then to reconstruct 
x from y. In the standard least squares approach, this reconstruction is achieved by solving the normal equations 
®k®'K x ~ ^tc^- T ne numer i ca l stability of this method depends heavily on the condition number of •t^-^^c which, in 
accordance with (f2j), has an upper bound of -. As such, when designing NERFs, our goal is to make K > M as small 

a 8 

as possible — meaning we are robust against more erasures — while keeping ^ from becoming too large; this contrasts 
with RIP where, as mentioned above, the goal is to make K < M as large as possible while keeping 5 small. 

The first study of erasure -robust frames was given in [11]. In subsequent years, it was shown that unit norm tight 
frames ar e op timally robust against one erasure ||6[] while Grassmannian frames IU7I1 are optimally robust against two 
erasures 01211 : this fact is in part responsible for the continued interest in equiangular tight frames B 1 71 1 1 9[ 1911 . In a 
purely algebraic sense, the frames which are most robust against erasures are those that have full spark 1141 1211. namely 
those collections of vectors \(p„}^ =l in M. M that have the property that any M of them form a basis for R M . However, 
in the presence of additive noise, what truly matters is conditioning of the subcollection, not its rank. This realization 
led to the introduction of NERFs in I1 1(111 . There, in attempting to construct NERFs, the authors ran into many of the 
same difficulties encountered when attempting to construct RIP matrices. This is due to the fact that both the RIP ([TJ 
and NERF conditions are seemingly combinatorial, requiring one to estimate the eigenvalues of {^j submatrices of 
O. Indeed, to date, the best known low -redundancy NERFs are obtained using random matrix theory 1 1 Oil - essentially 
because when coupled with a union bound, such constructions allow one to consider each M x K submatrix of O 
independently. Such constructions are becoming increasingly important as NERFs are proving useful outside of the 
original communications-based applications that inspired them; they can, for example, be used to reconstruct a signal 
using only the magnitudes of its frame coefficients yl. 

In the next section, we exploit the subtle differences in definition between NERFs and RIP matrices to introduce 
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a new trick which will enable us to estimate NERF bounds much faster than we can estimate restricted isometry con- 
stants. The key idea is to evaluate the entire frame analysis operator over a large grid of points on the sphere — an 
e-net — and, at each point, sort the resulting inner products according to magnitude. This trick reduces the number 
of computations from being exponential in N to being exponential in M. In Section 3, we then show how to exploit 
ideas from group frames to further reduce the computational complexity from being exponential in M to being subex- 
ponential in M. There, the method involves using a large group of orthogonal matrices — the symmetry group of the 
cube, for example — to simultaneously generate both the NERF and the e-net used to estimate its bound. Essentially, 
this trick allows us to work with an e-net that only covers a very small portion of the unit sphere, namely the set of 
all unit norm vectors with nonnegative entries sorted in nondecreasing order. In the fourth section, we then show how 
to explicitly construct such an e-net, and discuss how its size compares with another e-net generated by a popular 
method. Finally, in Section 5, we conclude with numerical experimentation, using our theory to construct explicit 
examples of NERFs and concretely estimate their bounds. 



2. Estimating NERF bounds with e-nets 

To motivate the results of this section, we begin with a few observations on NERF bounds. Given M < K < N, 

recall that if {ip}^ =l is a (K, a,y6)-NERF for R M then © holds for all ^-element subsets *7C C {1 N}. To better 

understand the problem of constructing NERFs, let's find an explicit expression for the optimal lower and upper 
NERF bounds, namely the largest value of a and smallest value of /3 for which © holds for all 7C. To do this, we first 
note that for any fixed subset 7C, the largest value of a and smallest value of /3 for which (f2]i holds are 

a% ■= min Y \{x, ip„)\ 2 , fi% := max V \{x, (p„)\ 2 , (3) 

namely the smallest and largest eigenvalues of (D^O^, respectively. As such, the largest value of a and smallest value 
of (3 for which (|2]i simultaneously holds for all Zf-element subsets 7C are 

av := min av = min min > \{x,ip„)\ 2 , Br ■= max /J*- = max max > |(x, w„)\ 2 , (4) 

\<K\=K \<K\=K\\x\\=\ ^ r r |7C|=r fK\=K\M=l*-j* 

respectively. That is, computing the optimal NERF bounds (|4]i involves finding the extreme eigenvalues of each of the 
{^j subframe operators of the form <$>* % . For even modestly-sized choices of N and K, this leads to an enormous 
amount of computation; as noted in the introduction, this parallels the computational difficulties that have, for several 
years now, stymied serious progress on the deterministic RIP construction problem. 

We now make three key observations which, taken together, give a different perspective on the NERF bound 
estimation problem. First, note that we can interchange the order of optimization in ©: 

a K = min min V \{x, ip„)\ 2 , [5 K = max max V \{x, (p„)\ 2 . (5) 
\\x\\=l\<K\=K*-£ M=im=K /-j 

neft nzK 

That is, whereas (|4|i asks us to "take any of the subsets of {1, . . . , N] and then find the unit norm vectors x e IR M 
which are most orthogonal/most parallel to {^„}„ e 7c," the alternative, but equivalent formulation (0 asks us to "take 
any unit norm x e R M , and then find the K vectors in {^ n )^ =1 which are most orthogonal/most parallel to it." The 
second key observation is that for any fixed unit norm x, the subsets K c {1, . . ., N] for which Yun^nc K x >^«)l 2 is 
minimized and maximized are quickly and easily obtained by sorting the values <p„)\ 2 }^ =l and then summing the 
K smallest values and largest values, respectively. That is, 

K N 

a K = min V |<*, </V(n)>| 2 , Pk = max V \(x, ipo-(„))\ 2 , (6) 
11x11=1 , iwi =1 ,^1^, 

n~\ n=N-K+l 



where cr is an x-dependent permutation of { 1 , . . . , AO chosen so that the values {|(x, ^o-(n))l 2 )^ = i are arranged in nonde- 
creasing order. The third and final key observation is that in the envisioned applications of NERFs, we do not need 
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to know ok and fa exactly; it suffices to have good estimates of them. In particular, it should not be necessary to 
evaluate 

K N 

(Xk{x) := 2^ K*» <P<r(.n))\ 2 , Pk(x):= \( x ><P<r(.n))\ 2 , (7) 

n=l n=N-K+\ 

at each of the infinitely many points x on the unit sphere in M. M . Rather, for the purposes of estimating and /3k, it 
should suffice to evaluate (Q over a large, finite grid of points on the sphere, namely over an e-net. 

To be precise, a set of points {i/f p }p =i is said to be an s-net for a set Q. equipped with a metric d if for every x e D, 
there exists iff p such that d(x, i// p ) < s. It turns out that it is most convenient for us to work with an e-net for the 
projective sphere in M. M under the chordal distance d(x\,X2) := -yl - \{x\, X2)\ 2 . In particular, given e > and letting 
S M ~ l denote the unit sphere in R M , we want 

GM£=i £ sX - Vx 6 §M_1 ' 3 P S - L K*. M 2 > 1 - e 2 - (8) 
Given such a net we, being inspired by ©, define s- approximate lower and upper NERF bounds by 

K N 

a K , B : = min V \(if/ p , (p IT (n))\ 2 , fa, e ■= max V \{i/r p , ipoin))\ 2 , (9) 
p=±,...,p * — i p~i,...,p — i 

n=l n=N-K+l 

where cr is a p-dependent permutation of {1, . . .,N] chosen so that the values {\(ifr p , VV(«))l 2 C=i are arranged in nonde- 
creasing order. That is, given an e-net ©, we form the estimates (0 by the following algorithm: for each p = 1 , . . . , P, 
compute Q>*i// p = {(if/p, <p n )} N =v then sort {\(ifr p , (p„)\ 2 }„ =l in nondecreasing order, and let cxk,s and /3k, e be the sum of 
the first K and last K resulting values, respectively. We now make this analysis rigorous, bounding the optimal lower 
and upper NERF bounds in terms of the estimates <j9j ; note that as e gets small these bounds become exact. 

Theorem 1. Let M < K < N, let {(p„}^ =i £ R M , and let {^ P } F p=l be any s-net ©/or § M ~'. Then, the optimal upper 
and lower NERF bounds ©/or {(fn}^ are estimated by the s-approximate bounds (0 according to 

j^(a K , e - j^fa,s) <a K < a K , s , fa, e < fa < T^fa, s - (10) 

Proof. For any /^-element subset 7C c {l,...,N}, let {A m }^ =l and {m,„}^ =1 be the eigenvalues and corresponding 
orthonormal eigenbasis of the MxM frame operator of the subcollection {<p„} n ex, yielding the spectral decomposition 

M 

ne<K m=l 

where x* denotes the 1 x M adjoint (transpose) of some M x 1 vector x. Now, given any x e R M , conjugating this 
expression by x gives 

M M 

2^ \(X, tf„}\ 2 = X* Yj fnfnX = X*$> % ®kX = X* ^ A m U m U* m X = 2^ A m \(x, U m )\ 2 . (11) 

neK n<SK m=l m=l 

Note that since {m,„}^ =1 is an orthonormal basis, then for any unit norm x, the values {\(x, u m )\ 2 }^ 1=l sum to one, meaning 
that (fTTT i corresponds to a weighted average of the eigenvalues {Am}^ =v Taking these eigenvalues in nondecreasing 
order, note that (01 gives A\ = a<K and Am = fa, meaning these weighted averages lie in the interval [a^,fa]- Since 
the e-approximate bounds Q are examples of such averages for certain choices of x and < K, we immediately obtain 
two of the four inequalities in (TlOb : 

a K = min a % < a K , E < fa iE < max fa = fa- 

\K\=K \K\=K 

Next, to obtain the upper bound on /3k given in (TTOt we, for any given K, consider (fTTb in the case where x is chosen 
to be the member of the e-net {i/f p } p=l which is guaranteed to be close to the eigenvector um corresponding to the 
largest eigenvalue Am = fa- That is, picking p such that \(i/f p , um)\ 2 > 1 - e 2 and letting x — if/p in (fTTb gives 

M 

Yj Wp'Vn)? =Yj Am Mp' Um) \ 2 ~ a m\(^ p ,u m )\ 2 = fa(l-s 2 ). (12) 

neK m=\ 

4 



At the same time, the left hand side of ( fl2b is but one of the ways to sum K choices of the values [\(tff p , y?«)| 2 }^ =1 
and is, of course, no larger than the sum of the K largest values: 

Z\(ifr p , (p„}\ 2 < max V \\(ifr p , ip„)\ 2 < max max V ' \{i/fp>,<p„)\ 2 = B K , B . (13) 
m=K p>=l,...,P\<K'\=K 

Combining (fT2T > and dl3ll . we see that B%(\ - e 2 ) < Bk, s for all K and so we obtain the upper bound on Bk from (fTOb : 

B K = max fa < jKpK.r, (14) 

We now use a similar approach to find the lower bound on arx; for any given 7C, taking x = if/ p in (TTTI) gives 

M M 

J] \<^ P ,fn)\ 2 = ^A m \{ijj p ,u m )\ 2 < Ai\(i// p ,ui)\ 2 + A M Y^\(^ P ^m)\ 2 = a K \(>/fp,ui)\ 2 + B K (l - \{^ p ,ui)\ 2 ). 

ne'K" m=\ m=2 

Simplifying, and picking p such that ifr p is the e-net member that satisfies \{iff p , «i)| 2 > 1 - s 2 then yields 

2 Up, fn)\ 2 =Pk-(Pk- vkWp, «i>| 2 <Pk-{Pk- <x<k){\ - e 2 ) = (1 - s 2 )a K + e 2 (3. k . (15) 

ne-K 

At the same time, the left hand side of ( TTBl l is no smaller than the sum of the K smallest values of [\(i/f p , i/j„)| 2 }^ =1 , 
meaning 

y \(tffp,<Pn)\ 2 ^ min y \{iff p ,<p n )\ 2 > min min V |<^,^„)[ 2 = or^ e . (16) 

'—- « A' -A - *-J p'-l,...,F A' -AT z — ^ 

Combining (TT3T > and ( TToT l and then using dT4b gives < (1 — s 2 )a<K + s 2 B<k < (1 - e 2 )a-x + y^Pk.e- At this point, 
solving for and then minimizing over all 7C gives the lower bound on from dTOb : 

jz^((XK, e - -^Pk, e ) < mma K = a K . □ 

We now give some remarks about the previous result and its proof. Note that the lower bound on aK in ([TUb is 
more complicated than the upper bound on Bk, as it involves both the approximate lower and upper frame bounds (Xk,e 
and Pk.f, from (|5). For those familiar with frame theory, this is to be expected: lower frame bounds are usually more 
difficult to estimate than upper frame bounds. At the same time, this fact is troubling, since the lower estimate is much 
more important than the upper estimate. Indeed, for small K, there is a real danger that the lower NERF bound could 
approach zero, see Theorem 1 1 of fioll . for example. Meanwhile, it is impossible for the upper NERF bound to grow 
too large: if the entire original frame {<p n } =i has an upper frame bound of B, then the upper NERF bound is at most B 
since 

N 

B K = max max y \(x,w„)\ 2 < max y \(x, tp n )\ 2 < maxB||x|| 2 = B. (17) 
[MI=l|9C|=z-^ \M=i^-f Nl=i 

ne'K n=l 

Moreover, though the estimates <j9j become exact as e tends to zero, choosing a small e forces the number of elements 
P in the e-net ([8]) to be large, making it prohibitively expensive to compute the approximate NERF bounds (0. In 
practice, we are therefore compelled to not take e to be too small; see the final section for a longer discussion on 
practical aspects regarding the size of e. We are therefore faced with a quandary: to ease our computational burden, 
we want to take an e which is not too small, yet doing so leads to coarse estimates (|5). Fortunately, by starting with a 
good enough frame {(p„}^ =l we can find an alternative set of estimates that, according to numerical experimentation, 
outperform (|9]i in cases where K and e are not too small. 

To be precise, a sequence of vectors {<p n } N _ x is said to be unit norm if ||^„|| = 1 for all n and is a tight frame if 
OO* = AI for some A > 0. Moreover, if both properties hold true, then {y?«}^ =1 is called a unit norm tight frame 
(UNTF) and the tight frame constant A is necessarily the redundancy ^ since MA = Tr(AI) = Tr((D<I>*) = Tr(0*0) 
V In particular, if {<p„}? l=l is a UNTF for WL M then j K*, <Pn)\ 2 = $ IWI 2 for all x and so (17) gives B K < f . Armed 
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with this fact, we return to the end of the proof of TheoremQ] again combining ( TT5T > and ( [ToT l. This time however, we 
forgo (Tl4T > in favor of the estimate /3k < fr, yielding a^e < (1 - s 2 )a-K + fi2 /^f ^ (1 _ e 2 )^ + s 2 jj f° r a ll *K and thus 



■rz^{a K ^ - s 2 jj) < min a % = a K . 



We summarize these facts in the following result. 

Theorem 2. Let M < K < N, let {<p n }^ =l be a unit norm tight frame for R M , and let {i!/ p } p =l be any e-net ©/or § M_1 . 
Then, the optimal upper and lower NERF bounds © for {<p n }^=i ore bounded by aK, s ® according to: 

To verify the intuition that led up to Theorem|2] note that it is indeed stronger than Theorem[JJ whenever 
% ^ jh-PK, e = tttt max max V \{i// p ,tp n )\ 2 = t^t max max(^ - V \{if/ p ,<p„)\ 2 ). 

ne<K n<f.K 

Simplifying, we see that this is equivalent to having 



(1 - e 2 )$ < f - min min V |<^„ y n )\ 



That is, Theorem |2] outperforms Theorem [TJ whenever there exists a p and 7C such that Yin<tx K&p* fn)\ 2 ^ 
Numerical experimentation reveals that this is often the case for the explicit NERF constructions we introduce in the 
following sections. Indeed, in those examples s is not too small and K is a significant fraction of N, meaning it is 
plausible for there to exist at least one if/ p which is nearly orthogonal to a small number of frame elements {(p n }„$jc. 
Before moving on to those constructions, we conclude this section with a brief discussion of how the above theory 
does not directly generalize to the deterministic RIP matrix construction problem. 

To be clear, e-nets are already popular subjects in compressed sensing, being fundamental tools of random matrix 
theory lfl6ll . However, in that literature, e-nets are exploited analytically rather than computationally, and are seldom 
constructed explicitly. In short, fix K < M < N and consider a tall M x K submatrix O^- of a randomly generated 
MxN sensing matrix <I>. If the rows of are chosen randomly, the distribution of ||<l>7cz|| 2 along any given direction 
Z e M. K is highly concentrated around its mean. By applying a union bound over an e-net for that represents the 
many different choices for z, one finds that with high probability, the values of ||<t>7<-z|| 2 are a nearly constant function 
of z. This, in turn, implies that the eigenvalues of O^-cD^ are nearly constant, as needed for RIR 

With regards to deterministic constructions of RIP matrices, the problem with this approach is that it essentially 
makes use of a distinct e-net for the column space of for each choice of 7C. To be precise, in the NERF construction 
problem, we have M < K and use a single e-net for M. M to simultaneously estimate the singular values of each short 
M X K submatrix of <I>. This stems from the fact that the optimal NERF bounds can be simplified by changing 
the order of optimization (O and sorting the inner products {(x, <p n )} N = i according to magnitude (0, yielding readily- 
computable quantities (0 to be evaluated over this e-net. That is, in the NERF problem, the choice of e-net is 
independent of 7C. The same argument falls apart when attempting to estimate restricted isometry constants since, 
as in the random approach, the e-net must lie in R. K , meaning it depends on one's choice of 7C. Formulaically, this 
is seen by writing the optimal lower restricted isometry constant ([TJ in terms of two successive minimizations which 
cannot be easily interchanged: 

2 



min min 

m=K y eR N 
supp(y)e7f 



2 y(n)<p n 

71=1 



Of course, this is not to say that the ideas presented here are not applicable to the RIP problem. Indeed, since the 
largest eigenvalue of ^>* % ^> K equals that of O^O^., Theorem [TJ can be used to estimate the upper restricted isometry 
constant of a proposed RIP matrix. However, the smallest eigenvalues of these matrices are essentially unrelated, 
meaning that a computationally tractable method for estimating lower restricted isometry constants remains elusive. 
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3. Group frame constructions of NERFs and s-nets 



In the previous section, we introduced a new method for numerically estimating optimal NERF bounds (0). This 
method involves evaluating the frame analysis operator <t>* at each point \f/ p of an e-net (O and sorting the resulting 
values [\(iff p , 'fin)\ 2 }^ = i in nondecreasing order. We then use these sorted values to compute the e-approximate NERF 
bounds (|9]l which estimate the optimal NERF bounds according to Theorems Q] and [2] In this section, we discuss how 
the number of elements in an e-net grows exponentially with M, making this approach computationally infeasible 
for general frames. We then show how we can sidestep this computational burden provided the NERF and e-net are 
constructed using the action of a large finite group of orthogonal matrices. 

Random matrix theorists have found elegant arguments for guaranteeing the existence of efficient e-nets lfl6ll . For 
example, given e > 0, we can iteratively construct {il/ p } p p ^ y as follows: take any \p\ e § M ~', and given {if/ p } k p=l , choose 
any i/^+i e § M_1 whose Euclidean distance from each previous tfr p is greater than e. Note that for any k, the closed 
balls around {if/ p } k p=l of radius | are disjoint from each other and, by the triangle inequality, lie inside the ball of radius 
1 + | centered at the origin. As such, the total volume of these tiny balls is less than the volume of the large one, 
meaning < (1 + |) M . In particular, we see that this process must terminate at some k — P, where P < (1 + |) M . 

Moreover, note this process only terminates when the e-balls around {i// p } p=l cover § M ~'. This means {i// p } p=l is an 
e-net for § M ~', both in terms of Euclidean distance as well as chordal distance ©, since for any x e § M ~' there exists 
p such that 

1 - |<jc, ^ p )\ 2 = (l + {x, - (x, < 2(1 - (x, = ||* - < e 2 . (18) 

In summary, this means that for any given e > 0, there exists an e-net (O for § M_I of at most (1 + -) M elements. 

Despite the elegance of this argument, this result is disheartening from a computational perspective, since (1 + |) M 
is enormous for even modest choices of e and M. Moreover, though it is possible to slightly improve the above 
estimates, one can also show — borrowing techniques from Section 3 of iflQIl . for example — that the number of points 
required for any e-net for § M ~ 1 grows exponentially with M. In particular, for an arbitrary frame \<p n , , it is infeasible 
to numerically compute <b*4i p for every choice of p. Indeed, it is this fact which motivates the main idea of this section: 
we construct frames {<fin}^ =l and e-nets {if/ p } p=l with such high degrees of symmetry that it suffices to only compute 
over a small subset of the if/ p 's. In particular, we found that such highly symmetric frames and e-nets can be 



constructed using the theory of group frames 11811 . As a side effect, the frames generated by this approach happen to 
be UNTFs, allowing us to use the NERF bound estimates of Theorem|2]in addition to those of TheoremQ] 

To be precise, let 1A — {U q }® =l be a finite subgroup of the group O(M) of all real M x M orthogonal matrices. We 
say that {tf n }^ =l is H -invariant if 

V<7 = \,...,Q, 3 a permutation cr of {1, . . .,N] s.t. U q ip„ = ±^ (r( „), Vn = 1, .. .,N. (19) 

That is, a frame is ^/-invariant if each element of the group 1A simply rearranges the frames elements; here, as is 
common in the frames literature, we make no distinction between a frame element and its negative, as each yields the 
same outer product, therefore leading to identical frame operators. As we now discuss, ^/-invariant frames permit us 
to compute their e-approximate NERF bounds ax, E and /3k, s © with surprising efficiency, provided the e-net possesses 
like symmetry. 

Indeed, consider an e-net obtained by orbiting a finite set of generators under the action of 1A. That is, given 
e > 0, choose {tAr)f =1 £ § M ~' such that {U q i// r }® =l * =I is an e-net ([8]). Here, note that computing a^ e and /3k, s 
seemingly involves QR operator-vector multiplications of the form $>*(U q \l/ r ): for each q and r, we first compute 
<S>*{U q (f r ) = {(U q i// r ,if„)}^ =l and then sum the K smallest and largest values {\(U q i[f r , </> n )| 2 }^ =1 ; taking the minimum 
and maximum of these lower and upper sums over all q and r yields a^ E and fix.e- However, only R of these operator- 
vector multiplications are truly needed: since {(p n }^ =l is ^/-invariant, then for any q and r, rewriting ( fT9l ) as U*(p„ = 
±ip -H„ ) reveals the values {\(U q ft r , <p„)\ 2 }^ =l = {\(<j/ r , U q ip„)\ 2 }f l=l = {K^,-, ^-i(„)>l 2 }^ = i to be a rearrangement of the 
values {\(if/ r , <Pn)\ 2 }„ = \- As such, the sum of the K smallest and largest values of {\(U q tfr r , y?n)| 2 )„ =I equals the sum of 
the K smallest and largest values of {\(i[t r , <p n )\ 2 }„=ii respectively. Thus, we truly only need to evaluate <f>*if/ r for each 
r, a 2-fold speedup over the direct method. We summarize these ideas in the following result. 

Theorem 3. Let U be a finite group of orthogonal matrices over M. M , and let {(p„}^ =l be U-invariant (119b . Then, 
choosing any {ijj r ) R r=l C § M ~' such that {U q i// r }® =l * =1 is an e-net ([§}, the corresponding s-approximate NERF 'bounds (O 
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can be computed as follows: 



IV IV 

a K , E = min V |(^ r , ip lT („))\ 2 , Pk,s = max V \{xj/ r , ipo-( n) )\ 2 

r=l,...,W^ — ' r=l,...,R — i 



where the r-dependent permutation o~ is chosen so that {\(if/ r , <Pain))\ 2 } n= i is arranged in nondecreasing order. 

In light of this result, we now focus on the problem of constructing explicit ^/-invariant frames and 1/-generated 
e-nets. In particular, recall from our previous discussion that the number of elements in our e-net {U q i{t r }® =l f =1 is 
necessarily very large. At the same time, we want to take R as small as possible in order for the approach of Theorem[3] 
to be computationally tractable. Together, these facts suggest we should use a group K that is very large. 

Taking a large K however poses a challenge from the perspective of constructing a useful ^/-invariant frame 
(<p n }^_ v Indeed, the most obvious construction of such a frame is as the orbit {U q ip} =] of some <p e § M ~' under the 
action of 11. In fact, it is known that such frames are necessarily UNTFs — meaning Theorem[2]applies — provided 
H is irreducible, that is, provided the vectors {U q x}® =l span K M for any nonzero x e M M . Unfortunately, when H is 
very large, such frames are also extremely redundant, making them unattractive from the standpoint of applications. 
However, as we see below, this issue can often be addressed by picking tp so that {U q <p}® =l is actually multiple copies of 
the same frame {<p n } N =v meaning the actual number of frame elements N will only be a small fraction of Q. Moreover, 
note that such a frame {tp„}^_, is also a UNTF whenever H is irreducible since its frame operator is ^ times the frame 
operatorof the UNTF {U q <p}® =v that is, <M>* = = ^1. 

These facts in hand, we are ready to find the explicit groups, frames and e-nets needed in Theorem[3] We begin 
by finding large finite irreducible groups of orthogonal matrices. In this paper we, for the sake of intuitive simplicity, 
focus on the groups of symmetries of the Platonic solids. To be clear, there are three types of such solids that exist 
in R M for every M, namely generalizations of the tetrahedron, octahedron and cube. The first is the simplex, whose 
symmetry group contains (M +1)! orthogonal matrices, each corresponding to a unique permutation of the simplex's 
M + 1 vertices. An explicit representation of this group is given in 110], where it is also shown that this group is 
irreducible. The second type of Platonic solid in R M is the cross-polytope whose 2M vertices are formed by taking 
the standard basis along with their negatives. The third type of Platonic solid is the dual of the cross-polytope, namely 
the hypercube whose 2 M vertices are formed by taking all M-long ±1 -valued sequences. 

Being duals, the cross-polytope and hypercube have the same symmetry group, namely the set of all 2 M M\ signed 
permutation matrices obtained by multiplying each of the M\ possible permutation matrices by each of the 2 M pos- 
sible ±1 -diagonal matrices. Note this group is irreducible since the set of all signed permutations of any nonzero x 
necessarily span R M : for any nonzero x' e R M , picking q and q' so that U q x and U q >x' are both nonnegative and 
nondecreasing, we have that (U q x)(M), (U q >x')(M) > and so 

< (U q x)(M) (U q ,x')(M) < (U q x, U q ,x') = ((U q }U q )x,x'). 

For the remainder of this paper, we focus exclusively on the case where K is the group of all signed permutations 
as opposed to the group of symmetries of the simplex. We do this for two reasons: signed permutations are easy to 
understand, and we seek the largest possible group with which to generate our e-net. That said, all of the main ideas 
below carry over to the simplex case. 

To reiterate, we generate our e-net by taking all signed permutations of a given set of generators {if/, }f =1 . The 
number 2 M Ml of such permutations is enormous. Indeed, for any fixed e > 0, this number eventually grows faster 
than the number (1 + |) M of points in the e-net we discussed earlier. Following the argument that led to Theorem[3] 
this means the sums of the K smallest and largest values of {\{i[f r , <fi n )\ 2 }^ = i equal those of {\(U q i// r ,ip n )\ 2 }^ =l for any 
q — 1, . . . , Q, meaning we can compute our e-approximate NERF bounds 2 M Ml times faster than before. 

Of course, this requires our frame {<p„}^ =l to be 1/-invariant, meaning that every signed permutation of any frame 
element yields another frame element, modulo negation. If we are not careful, this leads to impractically redundant 
frames. Indeed, if any frame element tp„ has entries with distinct absolute values, the total number of frame elements is 
at least 2 M ~'M!, leading to a redundancy ^ of at least 2 M ~ l (M - 1)!. The only way to avoid this is to generate {y?n}^ =1 
by taking all signed permutations of some ip whose entries are mostly zero, and whose nonzero entries assume only a 
few distinct values. For example, when M — 4, though there are 2 M Ml = 384 signed permutations of <p — [1 1 0]*, 
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only 12 of these lead to frame elements that are distinct modulo negation, namely 



1 

V5 



1 l l 

1 -1 o 

1 





1 



-1 





1 




-1 





1 
1 







1 



1 





1 



-1 



(20) 



To be precise, though there are 4 ! = 24 distinct 4x4 permutation matrices, there are only \ Z) = 6 distinct permutations 
of <p = [1 10 0]*. Moreover, though there are 2 4 = 1 6 distinct 4x4 diagonal matrices with diagonal entries + 1 , for any 
given fixed permutation of <p — [1 1 0]*, only 2 of these lead to frame elements which are distinct modulo negation. 
Indeed, for a general M, picking <p in this manner leads to a frame of N = 2^) = M(M - 1) elements; though the 
redundancy M - 1 of such a frame is still high for some applications, it is nevertheless much more reasonable than the 
redundancy of 2 M ~'(M - 1)! obtained for a general <p e S M_1 . 

To summarize, for any M we have fixed 1A to be the group of all M x M signed permutation matrices and have 
discussed how to use that group to construct examples of ^/-invariant UNTFs {ip„}^ =l . We now want to use the theory 
of this section and the previous one to explicitly estimate the optimal NERF bounds of {<p„}^ =l . In light of Theorem[3] 
all that remains to be done is to choose an e > and construct {i]/ r }^_ x Q § M ~' such that [U q if/ r } _j * =I is an e-net; the 
problem of constructing {(A r )f = i £ § M is the subject of the following section. 



4. Constructing s-nets for nondecreasing nonnegative vectors 

Letting H be the group of all M x M signed permutation matrices, note that combining the results of TheoremsQ] 
[2] and [3] gives a relatively fast method for estimating the optimal NERF bounds © of a ^/-invariant UNTF {<p„}^ =l , 
provided we are able to construct a relatively small number of vectors {if/ r }f =l £ 8 M ~' such that {U q if/ r } _j * =I is an 
e-net © for § M ~'. In this section, we discuss how this last problem is equivalent to constructing a small number of 
vectors {iA r )f =I which form an e-net for the small portion of the unit sphere that consists of all vectors whose entries 
are nonnegative and are arranged in nondecreasing order. We further discuss one explicit method for constructing 
such a collection [iff r } R = i- 

To be precise, we denote the set of all nonnegative, nondecreasing unit-norm vectors as 

S^ 1 := {x e § M - 1 : < jc(1) < • ■ ■ < x(M)}. (21) 
Note that the orbit \U q ^/) Q _, of any given ip e § M ~' under the action of K is invariant under all signed permutations 

Op 

of As such, when considering e-nets of the form {U q i// r }^ =l * =1 we may, without loss of generality, assume that 
ifr r e S^ 1 for all r. Moreover, for such {0>}* =1 we now show that {U q i// r }® =l f =1 being an e-net for § M ~' © is 
equivalent to {0>}f =I being an e-net for S^ -1 , namely to having 

mU £ Kn 1 s.t. Vx e §f n -', 3r s.t. (x,^ r ) > Vl-e 2 . (22) 

Lemma 4. Let {if/ r }f =l £ §™ 1 EJ an d let U — {U q }® =l be the group all signed M x M permutation matrices. Then 
{<Ar}f =1 is an e-net for §^,~' (122b if and only if{U q i// r }^ =l R r=l is an e-net for § M_1 d8). 

Proof. The "only if direction is straightforward, since if {0>}f = i satisfies (l22l >. then for any x € S M_1 , we can take q 
such that U q x € S^ _1 , and pick i[r r such that \(x, U~ l tl/ r )\ 2 = \(U q x,t// r )\ 2 > 1 - e 2 . This means we can pick the "ifr p " 
in © to be U q l \p r . 

For the less obvious "if direction, note that if {if/ r }^ =l £ has the property that {U q ifr r } =1 * =1 satisfies ©, then 
for any x e S^ 1 £ § M_1 , there exist q and r such that \{x, U q if/ r )\ 2 > 1 - e 2 . Furthermore, by replacing the signed 
permutation with its negative if necessary, we may assume without loss of generality that (x, U q iff r ) > 0, meaning 
we actually have (x, U q ip r ) = \(x, U q ip r )\ > Vl-e 2 . We now claim that if both x and ifr lie in §^ _1 , then the signed 
permutation U € H that maximizes (x, Uif/) is the identity 14 = I. Note that proving this claim gives the result since 
it implies (x, i// r ) > (x, U q ip r ) > VI - e 2 , as needed for d22l . 

To prove the claim, note that every signed permutation U is of the form (Ui//)(m) = {-l) T(m) ifr(o-(m)), where cr is 
a permutation of { 1 , . . . , M] and r is a {0, 1 }-valued function defined over { 1 , . . . , M}. As such, our goal is to find a cr 
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and t that maximize (x, Uif/) — Yim=\ ^('w)(-l) T(m V( cr ( m ))- Of course, since the entries of x and i/r are nonnegative, 
such sums only grow larger by taking r(m) = for all m. As such, this problem boils down to finding cr such that 
Yim=\ x(m)iff(cr(m)) is maximized. To do this, note that for a maximizing ctq and any mi < mi, evaluating our sum 
both at (To as well as at its composition with the two-cycle <x that interchanges cr (mi) and cro(mi) gives 

M M 

< ^ x(m)if/(o-Q(m)) - 2^ x(m)ip({& o cr )(m)) 

m=l m=l 

= x(mi)^(cr (mi)) + x(m 2 )\p(o-o(m2)) - x(m 1 )i/r(cro(m 2 )) - x(m 2 )^(cr (mi)) 
= [*(mi) - x(m 2 )][iA(o-o(mi)) - tKa- (m 2 ))]. 

Since x is nondecreasing, this implies that either (i) x(m\) = x(mi) or (ii) x(m\) < x(mi) and iff(cro(mi)) < iJf(<To(m 2 y). 
That is, the values of i/f(<x (m)) increase whenever the values of x(m) strictly increase. Moreover, over intervals where 
x(m) has constant value, we can rearrange the values ^(o"o(m)) so that they increase there as well; modifying <tq in 
this way preserves the value of Em=i x(m)tff(cr(m)), meaning this new permutation cr\ is also a maximizer. Overall, 
we see that any ctq which maximizes £m=i x(m)i//(cr(m)) yields another maximizer <j\ that has the additional property 
that if/ o o - ! is nondecreasing. To conclude, note that although there may be several such o"i's, the fact that i/f is itself 
nondecreasing implies that if/ — if/ o o"i, meaning that the maximum value of (x, Uiff) = Dm=i *(«)(- l) T ' m V( cr ( OT )) is 
indeed (x, i/r) = £m=i x(m)t//(m), as claimed. □ 

Having Lemma|4] we see the true computational advantage offered by Theorem[3] whereas a direct computation 
of the e-approximate bounds (0 requires evaluating <t>* at every point of an e-net for the entire sphere, we instead 
only need to evaluate <t>* at every point of an e-net (1221 for the small portion of the sphere that consists of nonnegative 
nondecreasing vectors ( f2Tb . provided our frame is invariant under signed permutations. Indeed, note that since the 
signed permutations of S^ 1 form an essential partition of § M ~' , we have that the surface area of S^f" 1 is that of § M_1 
divided by 2 M Ml. It is therefore plausible for the number of elements R in an £-net for S,^ 1 to be much smaller than 
the bound (1 + - e ) M on the number of elements in the e-net for S M ~' that we discussed in the previous section. At the 
same time, it is unreasonable to expect R to vanish on the order of (1 + |) M /(2 M M!), since you cannot form an e-net 
for by simply taking the elements of an e-net for § M_1 that happen to lie inside of S^ 1 ; doing so ignores the 
fact that §^~' is increasingly thin as M grows, meaning that for large M, many of the elements that cover parts of 
§^~' actually lie outside of it. 

To get an upper bound on the number of elements R in a decent e-net for §^~\ we now mimic the volumetric 
argument that produced the (1 + |) M bound on the number of points in an e-net for § M ~' . Here, it is more convenient 
to work with oo-balls (cubes) rather than 2-balls, since the nonnegativity and monotonicity conditions that define S^ _1 
must hold for all indices m. 

To be precise, given e > 0, we iteratively choose if/ r e S^ _1 so that \\if/ r - (/vlU > M~'-e for all r' < r. We 
continue to do so until it is no longer possible, forming {i/v}^ that has the property that for every x € §^~', there 
exists r such that \\x - ^lU < M~~-s. Note that such a {^ r }f = i is an e-net for S^~' d22"l) since for any x e S^ 1 , ( [T8T l 
gives 1 - \{x, i^,)| 2 < ||x - i/r r || 2 < M\\x - (4vIIm - fi2 - All that remains is to bound R. To do this, note that the oo-balls 
around {(4v} fi _j of radius jM~^s are disjoint, meaning their total volume R(M~ie) m is less than the volume of any 
set that contains them. To find such a set, take any x e K M for which there exists r such that \\x - if/ r \\oo < \M~t-e. 
Since if/ r e S^" 1 , this means there exists a nonnegative nondecreasing function each of whose entries i/f r (m) is within 
units of x(m). In particular, defining a> e R M , a>(m) := (2m — Y)\M~is, we see that x + co is nonnegative and 
nondecreasing. Moreover, the maximum entry of x + o) is easily bounded from above: 

II* + a>\\oo < \\x - i/fr\\oo + IkMU + IMU < \M-±s + \\i/f r \\ 2 + (2M - l)±ifcH e < 1 + M^-s. 
Writing x = x + a>, these facts together imply that the oo-balls around {|Ar}f = i of radius jM~^s all lie inside the set 

-a) + (iel M : < x(l) <■■■ < x(M) < 1 + M^e), (23) 

whose volume equals the fraction of the volume [2(1 + M^e)] M of the cube {x e R M : \\x\\<x, < 1 + M^-s] that corre- 
sponds to nonnegative nondecreasing vectors x. As the signed permutations partition of this portion of the cube form 
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an essential partition of the entire cube, we see that the volume of (O is [2(1 + M? s)] M I (2 M Ml) = (1 + M?s) M /M\. 
To summarize, for the e-net {t/'/-}* =I for §^ _1 that we have constructed in this manner, we necessarily have that the 
total volume R(M~~-e) M of the tiny cubes is no more than the volume (1 + M^e) M /Ml of (1231 1, Solving for/? and then 
using Stirling's approximation Ml > y[2~jtM M+ zz~ M gives 



VM\ M e M / 1 ^ M 

,,,i-r < 1 + — — 

Ml\ s I ^J2^M\ ^fMs> 



R<—\M+^\ < 1 + — . (24) 



Note that this bound on the number of points R in an e-net for S^ -1 is an improvement over the number of points 
(1 + =) M . Indeed, the base of the exponent in (l24l can remain constant even if we allow s = -^=. That is, for M large, 
we can produce an excellent estimate for the NERF bounds of a signed-permutation-invariant frame using far less 
computation than is needed for a coarse estimate of the NERF bounds of a not-so-symmetric frame. 

On the other hand, this method for constructing {if/, j leaves much to be desired from the computational perspec- 
tive expressed in Theorem [3] Indeed, note that this method of iteratively "picking if/ r such that \\iff r - if/ r >\\co > M~~s 
for all r' < r" is not explicit, yielding no actual values for the entries of if/ r with which to compute a^.e and fix.e- 
Moreover, the bound (l24l grows exponentially with M; if R truly grows at this rate, then even if we did have {i/'/-}f =I 
explicitly, it would be computationally intractable to compute a^.s and /3k, s via Theorem [3] for all but the smallest 
values of M. 

As such, the remainder of this section is focused on the problem of constructing an explicit e-net {i/'Af^ for S^, -1 
which has the property that R grows subexponentially with M. We stress that at this point in the discussion, it is far 
from clear that such e-nets even exist; as far as we know, the only way to demonstrate their existence is to use the 
explicit construction method we now introduce. 

When constructing e-nets ( l22b for S^ _1 , we found it helpful to view the vectors x, iff r e §^~' as nonnegative, 
nondecreasing unit-norm functions over a discrete real axis {1, . . . , M). This perspective allows us to draw inspiration 
from real analysis, specifically the theory of integration. In short, an e-net (1221 can be viewed as a fixed set of 
nonnegative, nondecreasing, unit-norm functions {if/ r }^ =l which has the property that every other such function x looks 
a lot like one of them. In the theory of integration, the standard way to approximate a nonnegative, nondecreasing 
function x is to use a step function. 

To be precise, we will first construct an explicit set of nonnegative, nondecreasing step functions {iA;-)f =1 and then 
normalize them if/ r := $v/||0>|| to form our e-net {^ r }* j £ §mi 1 ■ in order to retain control of the number R of these 
step functions, we will only allow the functions \jr T to attain one of L distinct possible positive values {b/}^ . Here, 
we fix bo = 1 and assume the b/'s are sorted in decreasing order. That is, in a manner similar to Lebesgue integration, 
we take horizontal slices of any given nonnegative, nondecreasing, unit-norm x, decomposing {1, . .., M] into the 
preimages x~ l (bi + \,bi\ = {m : bi+\ < x(m) < bf\. For any such x, we then compute an approximating step function \jr x 
by "rounding up" the values of x, that is, by defining 

7 , x / bi, b M < x(m) < bi, 

^ (W):= K-1, < x(m) < 6w . (25) 

Note that since x is nonnegative and has unit norm, taking bo = 1 ensures that (f25t indeed defines $r z at every index 
m. The remaining quantization levels {fr/lfr/ are free for us to choose. Though we investigated spacing these levels 
uniformly, that is, letting b/ — ^ for all /, we found that exponential spacing led to better, more elegant results about 
the approximating properties of our step functions. As such, for any 5 e (0, 1), we let b\ = 5 1 for all / = 0, . . . ,L — 1, 
meaning d25l > becomes 

6', 6 l+l < x(m) < 6 1 , 

8 L -\ < x(m) < 6 L 

Using such exponentially spaced levels makes it straightforward to estimate the inner product between any given 
x € S^ -1 and its corresponding step function :- $r x /\\ijf x \\: 

u 



*x(m)-={ ei-'i n ^ ^ ^ x l-i (26) 



ll<A.vll 



^ x{m)\j/ x {m) 

m— 1 

(^l^(m)l 2 ) 2 



(x, ip x ) m= i 



m=l 



n 



To be clear, we want a lower bound on ( 1271 ). as needed for our e-net (l22l i. Moreover, a lower bound on the numerator 
in ( 1271 ) follows immediately from the fact that we "round up" in d2"6l i. implying $r x (m) > x(m) for all m, and thus 

M M 

£ x{m)fa{m) > J] \x(m)\ 2 = ||x|| 2 = 1. (28) 

m=l m= 1 

Meanwhile, an upper bound on the denominator in d27b can be obtained by exploiting the exponential spacing of 
our quantization levels. Indeed, for any index m such that x(m) > 5 L ~ l , we know there exists I = 0, . . . , L - 2 such 
that 6 l+l < x(m) < S 1 ; here, the fact that x(m) < 1 follows from x(m) < \\x\\ = 1. For such m, ( l26b then gives 
$r x (m) — 6 1 — 5~ l S l+l < 6~ x x(rri), In particular, the square of the denominator in (l27l i satisfies 

MM M 

£|<^(m)| 2 = £ |-A,(m)| 2 + J] |<A,(m)| 2 

m=l m=l m=l 

x(m)>6 L - 1 x(m)<S L - 1 

M M 

< £ <T 2 Wm)| 2 + £ ^) 

m= 1 m= 1 

x(m)>S L - 1 xlm)<8 L -' 

< 5~ 2 + MS 2(L - l) . (29) 
Putting (l28l and d29l together gives our lower bound on (l27b : 

<x, ^> > [^ 2 + M<y 2(L_1) ]-i (30) 

At this point we pick 5 e (0, 1) so that the bound ( f30b is as strong as possible, meaning we pick 5 to be the square root 
of the x e (0, 1) that minimizes - + Mx L ~ l . For L > 2, calculus reveals this minimizing x — 6 2 to be [M(L - 1)] _I , at 
which point (f30b becomes 

<x,^)>(^i)^[(L-l)M]-i. (31) 

As a sanity check, note that for any fixed M, the right hand side of (fJTJ approaches 1 from below as L grows large. 
This makes sense, since for a large number of quantization levels, we expect our step function iff x to be a very good 
approximation of x, meaning (x, tff x ) » (x, x) = 1. At the same time, for any fixed L, this same quantity vanishes as M 
grows large. This is due to the fact that using any fixed number of levels results in increasingly poor approximations 
of some high-dimensional x's. For an e-net d22l) . we seek a compromise between these tw o extre mes. In particular, 
for any given s > and M, (f3TT > suggests we take L such that (■^■) 2 [(L- l)M]~2i > Vl - e 2 . Rearranging this 
expression, we summarize these facts in the following result. 

Lemma 5. For any M and e, take 6 - [M(L - 1)]~^ and L > 2 such that 

{L-\){\-s 2 ) L <±{^) L . (32) 

Then for any x e the step function \fr x — iff x /\\tfr x \\, where ij/ x is defined in (126b , satisfies (x, if/ x ) > Vl - s 2 . 

To get a better idea of how large L has to be in order for d32l l to hold, note that (^-) L approaches - for large L. 
More precisely, making use of some easily checked facts, we find that for any L > 2, 

- log[(^) L ] = l + Lf^^y<l + Lfl£^*xay=l + ^<2, 

and so (^-) L > 4. In particular, in order for d32b to hold, it suffices to pick L > 2 so that 

(L-l)(l-e 2 ) L < ^. (33) 

In essence, this means that for any fixed e, we can take L to grow as a logarithm of M. This is significant since the 
size of L strongly affects the number of distinct step functions of the form i// x . 
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To elaborate, note that in light of Lemma [5] we would like to define our e-net {^rHLi f° r §™ 1 as the set of all 
distinct step functions of the form i// x for some x e S^ -1 . Unfortunately, it is difficult to describe this set of i// x 's 
explicitly. Instead, we settle for a slightly larger set that is easier to parametrize. To this end, note that for any non- 
negative, nondecreasing, unit-norm tf/ x , we have that the pre-normalized step function {fr x d25l l is also nondecreasing. 
As such, each ifr x lives in the finite set 



and so we let [iff r } R =1 be an enumeration of the points in §^ _1 (L, 5). To find the number of elements R in this e-net, 
note that each ip e §^ _1 (L, 6) arises from a unique nonincreasing function r\ : {\,...,M} — > {0, . . . , L - 1} so that 
{(i(m) = 6 rl< - m > for all m. That is, R is the number of all such functions rj. Since there are at most L choices of rj(m) for 
each m = 1, . . . , M, we clearly have the upper bound R < M L . To compute R exactly, note that each nonincreasing rj 
corresponds to a unique way in which M can be written as a sum of L nonnegative integers, each integer representing 
the number of times rj attains a given value I. This is one of the classical "stars and bars" problems of combinatorics: 
each choice of rj corresponds to a unique way of placing L - 1 "bars" in the spaces between M + L "stars," and then 
removing one star from each of the L resulting blocks. That is, we have 



For any fixed e > 0, recalling from ( 1331 that L can grow logarithmically with M, we therefore see that (l34l defines an 
explicit e-net for §^~' whose cardinality is a subexponential function of M. This is a dramatic improvement over our 
volumetric bound d24l) on the size of such e-nets. In the next section, we provide numerical experimentation to better 
indicate exactly how R grows as a function of M and e. We conclude this section by combining these final facts with 
the results of Theorems Q~|[2] and [3] as well as Lemmas |4] and [5] yielding our main result. 

Theorem 6. Let {ip n }^ =l be a unit norm tight frame for M M which is invariant d!9l > under the action of M x M signed 
permutation matrices. For any e > 0, let 6 = [M(L — l)] - ^, take L > 2 such that (L — 1)(1 — e 2 ) L < j^(^-) L , and 



where ax, e and /3x, e are found by the following process: for any r — 1 , . . . , R, let aK, E ,r and f$K,e,r be the sums of the K 
smallest and K largest values of{\(ifr r , <^n)| 2 )^l], respectively; let ax, e — min ctK,e,r and /3k, s — maxjS^r . 

5. Numerical examples of NERFs generated via symmetry groups 

In the previous section, we provided our main result (Theorem |6]i which encapsulates all the main ideas of the 
paper, providing a numerical scheme for estimating the optimal NERF bounds of certain highly symmetric frames. In 
this section, we offer numerical experimentation to better indicate how useful this result actually is. In short, we shall 
see that while the method of Theorem|6]is extremely fast compared to existing methods, it still requires a significant 
amount of computation for even modest choices of M and e. 

We first describe our approach to implement Theorem [6] Given a fixed K, then for each choice of r — l,.,.,R, 
we perform the following three steps: we first compute {\(iff r , <p n )\ 2 }„ = i using O(MN) operations; we then sort the 
resulting values in nondecreasing order using 0(N log N) operations; we finally sum the K smallest and largest values 
of [\(iff r , y?«)l 2 )^ = i to form ctK, E ,r and fSx.e.r, respectively. That is, for each r — 1, ...,R, we expect to perform 0((M + 
log A0A0 operations. Implementing these calculations as a "for" loop over all r = 1, ...,R, we keep track of a running 
minimum of a^ e ,r as well as a running maximum of PK,e,r , in the end using 0((M + log N)NR) operations to compute 
czk,£ and /3k, s , as needed for 1351 . 

Note the above approach presents an opportunity: though Theorem|6]as stated applies to some fixed K, the above 
process can be slightly modified so as to simultaneously estimate the optimal NERF bounds that arise for every choice 




(34) 





(35) 
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of K between M and N. To be clear, consider the point in the above process where, for any given r, we have arranged 
the values {\(iff r , <fi n )\ 2 }^ = i in nondecreasing order. Rather than fixing K and summing the K first and last values in 
this list, we can alternatively, at the cost of only an additional O(N) operations, compute cumulative sums of this list 
from both its beginning and end, thereby simultaneously computing the values a^ E .r and Pk.es for all choices of K. 
Keeping the running termwise minima and maxima of these cumulative sum functions then produces <xk,e and /3k, s 
which, when used in (|35T >. estimate the optimal NERF bounds of [(p n } N =l for every choice of K. In particular, this trick 
allows us to determine, at little additional cost, the point at which our lower bound on <xk,s becomes positive, namely 
the smallest value of K for which our approach guarantees that every K vectors in {ip„}^ =l span R M . 

On the topic of speed, we further note that in nearly all the numerical examples we investigated, R turned out to 
be much larger than both M and N. Since our approach involves 0((M + log N)NR) operations overall, this means 
that the size of R is the key factor in determining whether we can, in a reasonable amount of time, determine the 
desired estimates for a given frame {<p n }^ =1 and e > 0. Moreover, as stated earlier, for any fixed e > the size of the 
smallest L > 2 such that (L - 1)(1 - s 2 ) L < j^(^-) L grows logarithmically with M, and so the number of elements 
R = ( M £ L [ ') - M L in the e-net (1341 grows subexpontially with M. Though this is an improvement over the exponential 
growth in M that we expect from d24l . this number R can still be enormous for even modest choices of M and e > 0. 
For example, when applying Theorem[6]to the frame (l20l of N - 12 elements in M — 4 dimensions, the sizes of the 
smallest possible L > 2 and resulting value (^^-i^) are ' f° r var i° us values of e, given in the following table: 



£ 2 


L 




^improved 


2- 1 


6 


126 


45 


2- 2 


19 


7315 


1107 


2- 3 


47 


230300 


15916 


2- 4 


110 


6438740 


202628 


2- 5 


249 


164059875 


2366922 



The values in the fourth column correspond to the improved values of R obtained by working with an e-net which 
is a proper subset of the e-net §^ 1 {L, S) given in d34b . To be precise, recall that the motivation behind §^,~' (L, 6) is to 
take an easily-parametrized e-net which, in accordance with Lemma|5j contains the normalized versions ip x of all the 
{6 }f~} -valued step functions \jr x obtained by "rounding up" any nonnegative, nondecreasing unit norm x. Though the 
definition ( f34b of §^ _1 (L, 6) relies on the fact that such x's are nonnegative and nondecreasing, it takes no advantage 
of their unit length. Indeed, in order for a nondecreasing {S l }f~2 -valued step function {jj to arise as the rounded-up 
version \jr x of some x e note it necessarily satisfies: 

M M 

i = Yu ^ Z ^ (w)|2 = ^ ||2 - (37) 

m=i m=i 

Moreover, recalling the previously used fact that <Jf x (m) < S~ l x(m) for all indices m such that x(m) > 6 L ~ l , such a >}i 
must also satisfy: 

M M M M 

i = 2 |x(m)|2 - Tj |x(m)|2 - Z ^i^i 2 = 52 X ^ (m)|2 - (38) 

in=\ m=\ m=l m=\ 

x(m)>6 L -^ x(m)>5 L - 1 tj/(m)>S L -' 

In particular, we can obtain an alternative version of Theorem|6]in which the e-net 1 (L, 6) is replaced with a proper 
subset of itself, namely 



S^CL, S)nU=-£- : m\ 2 > 1 > 6 2 J] |to| 2 . (39) 



m=l 

$j(m)>6 L - 



In the special case where M — 4 and N -2, the number of elements in ( [391 for various choices of e is given in the last 
column of (l36i >. We note that in practice, we compute the members of this smaller e-net by still forming each of the 
( M ^i _1 ) possible choices of i[/ e However, those i//'s that arise from 0-'s which do not satisfy the additional 
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conditions of d39l are "skipped" in the process of computing ciK,e and f3x, E - That is, for such iff, we do not compute, 
sort, and form cumulative sums of the values [\(ifr, <p n )\ 2 } n= i' For example, implementing this alternative version of 
Theorem [6] in Matlab and applying it to the 12-element UNTF (1201 for K 4 , we obtain the following values for the 
e-approximate lower NERF bound ax. e for various values of e: 



B 2 


K = 1 


K = 2 


K = 3 


K = 4 


K = 5 


K = 6 


K = l 


K= 8 


K = 9 


K = 10 


K= 11 


K = 12 


2-' 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.3821 


0.7275 


1.0039 


1.5811 


2.1068 


3.0000 


2-2 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.3824 


0.7193 


1.0003 


1.5213 


2.0325 


3.0000 


2 -3 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.3821 


0.7192 


1.0000 


1.5085 


2.0117 


3.0000 


2-4 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.3820 


0.7192 


1.0000 


1.5036 


2.0047 


3.0000 


2- 5 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.3820 


0.7192 


1.0000 


1.5015 


2.0021 


3.0000 


a* 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.3820 


0.7192 


1.0000 


1.5000 


2.0000 


3.0000 



(40) 



We then use each ax, B to find the lower estimate j^t [&k,e — s2 jj) on optimal lower NERF bound ax'- 



E 1 


K = 1 


K = 2 


K = i 


K = 4 


K = 5 


K = 6 


K = l 


K = 8 


K = 9 


K = 10 


K = 11 


K = 12 


2-' 


-3.0000 


-3.0000 


-3.0000 


-3.0000 


-3.0000 


-3.0000 


-2.2358 


-1.5451 


-0.9921 


0.1621 


1.2135 


3.0000 


2" 2 


-1.0000 


-1.0000 


-1.0000 


-1.0000 


-1.0000 


-1.0000 


-0.4901 


-0.0409 


0.3337 


1.0284 


1.7100 


3.0000 


2" 3 


-0.4286 


-0.4286 


-0.4286 


-0.4286 


-0.4286 


-0.4286 


0.0081 


0.3934 


0.7143 


1.2955 


1.8705 


3.0000 


2-4 


-0.2000 


-0.2000 


-0.2000 


-0.2000 


-0.2000 


-0.2000 


0.2075 


0.5672 


0.8667 


1.4038 


1.9383 


3.0000 


2 -5 


-0.0968 


-0.0968 


-0.0968 


-0.0968 


-0.0968 


-0.0968 


0.2975 


0.6457 


0.9355 


1.4532 


1.9699 


3.0000 


a K 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.3820 


0.7192 


1.0000 


1.5000 


2.0000 


3.0000 



(41) 



For the purposes of comparison, the last rows of (l40b and ( |4"TT i give the actual numerical values of the optimal lower 
NERF bound ax for each K. Note that each ax lies below its upper estimate ( l40b and above its lower estimate (|4H . 
Moreover, as a function of e, the upper estimate ax, e appears to converge to ax much faster than the lower estimate. 
This makes sense, since ax, E is the solution to a discretization of the minimization problem (|4]i that defines ax- 
Regardless, note that even for the not-too-small value 2~ 3 of e 2 , we see from the coarse estimates in the corresponding 
row of fiTT i that any 7 of these 12 vectors form a frame for R 4 ; looking at the frame itself (l20l . we see this is the 
smallest K for which this is true since the last 6 columns of <t> clearly do not span R 4 . In short, sometimes even 
not-so-small values of e are good enough to produce a positive lower bound on ax- This is significant since otherwise 
we may have no information about ax whatsoever. 

To be clear, the "exact" values of ax given in the final rows of d4Qb and (HTt were obtained for any K by having 
Matlab find the minimum of the smallest eigenvalue of O^O^ over all ^-element subsets K Q {1, . . ,,N}\ this 
approach is only feasible in this particular example since N - 12 is so small, meaning is not too large. Indeed, 
for such small values of /V, this approach is faster than our e-net-based technique since the number of points in our 
e-net (136*1 1 becomes enormous when e gets small. For example, on a current generation laptop, we could compute the 
ax row of (1451 in under a second whereas the e 2 = 2 -3 and s 2 = 2~ 5 rows took several seconds and over an hour, 
respectively. When M and /V becomes large, this phenomenon disappears and our methods truly start to shine. 

For example, when M = 6, taking {ip n }^ =1 to be the /V = 2 3-1 Q = 80 signed permutations of ip = [1 1 1 0]* 
which are distinct modulo negation, our Matlab implementation of these ideas took around 8.84 seconds to find the 
following lower estimates on ax for K = 61, . . . , 80 using s — 4: 

61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 
0.62 1.22 1.78 2.30 2.89 3.38 3.95 4.46 4.92 5.33 6.06 6.46 7.20 7.93 8.64 9.33 10.25 11.14 12.03 13.33 

To do so, it found the smallest sufficient number of quantization levels to be L = 21, and evaluated the 80 X 6 
analysis operator of {f n }^ =x at each of the R = 32372 points of the y^j^j = 230230-element e-net d34l > that satisfy 
the additional requirements d37T > and (f38l . Note that in particular, these bounds indicate that any 61 of the 80 frame 
elements span M 6 . Obtaining this same fact directly involves forming each of the \^\ » 1.16 x 10 18 such submatrices 
and showing they have full rank. More importantly, these bounds indicate how well these subcollections span: using 
the upper bound (5 x < % — ^jf , we see that for any 61 of these 80 vectors, the condition number of their frame operator 
is at most Sji -L- a; 2 1 .50, a very reasonable number for stable reconstruction. 

For a more dramatic example, letting {f}^ =l be the N = 560 distinct-modulo-negation signed permutation of 
(p = [1 1 1 1000 0]* in R 8 and taking e = I, our Matlab algorithm took about three minutes to show that any 399 
of these vectors spanned R 8 . Moreover, it gave the e-approximate NERF bound 0404 s » 1.17 and so we know that 
the condition number of the frame operator of any 404 of these vectors is no more than ^fj\j < 60. At the same 

15 



time, it provided similar upper bounds on the condition number of O^tD^. for any 7C such that \"K\ — K > 399. To 
do so, the algorithm took L = 22 and evaluated the 560 x 8 analysis operator of {^ B }^ at the 503487 points of the 
4292 145 -element e-net d34l i that satisfy d37b and d38l ). While this is indeed a large amount of computation, it pales 
in comparison to simply forming the (^^j » 2.84 x 10 142 possible 8 x 404 submatrices of <t>, let alone estimating the 
condition number of each such matrix individually. Even more dramatically, taking the similarly constructed frame of 
N = 4032 elements in M 10 and letting s = i, our Matlab algorithm took about one hour and seventeen minutes to show 
that any K - 2883 of these elements form a frame for M 10 ; the sheer number of such subframes is mind-boggling: 
0« 3.65X101044. 

To our knowledge, the theory of this paper is the only currently known method for estimating the optimal NERF 
bounds for matrices of these sizes. To be clear, in [ 10] it is shown that if the entries of MxN matrix are independently 
chosen from a standard normal distribution then <t> has a high probability of being a good NERF provided K is taken 
to be no less than about 85% of N. However, in the example above, we got a meaningful result where K = 399 is less 
than 72% of N. To date, the best known explicit construction of a general family of NERFs is to use an equiangular 
tight frame (ETF) of M 2 - M + 1 vectors in C M ; such frames are robust against the removal of up to half of their 
frame elements 1 Oil - Though the examples we present here do not rival the erasure-robustness of these ETF NERFs, 
they do provide a lot of design freedom that such frames do not. In short, the methods used to prove that such ETFs 
are NERFs do not generalize in the slightest, whereas the methods presented here apply to a whole family of frame 
constructions. To be clear, when the ETF-inspired methods are applied in the group frames setting [10], one finds that 
K may be taken almost as small as N - jj. However, this fact has a more direct proof: For any UNTF {ip„}^ =l and any 
7Cc {1 V|. 

N 

$||x|| 2 = J] \{x, <p„)\ 2 = J] \(x, <p„)\ 2 + J] |<jc <p n )\ 2 < J] \(x, <p n )\ 2 + (N- K)\\x\\ 2 , Vx € M M , 

n=\ neK n{K neK 

where the final inequality follows from the Cauchy-Schwarz inequality. Thus, we see that 

[K - (N - $)]||x|| 2 < J] l<*> Vn)\ 2 < $IWI 2 , Vx e K M , 

meaning that for any K the corresponding optimal NERF bounds satisfy K - (N - y) < < fix < ~. In particular, 
it was already apparent that the previously discussed group frame of N - 560 vectors in is robust up to ^ = 70 
erasures, meaning we could already safely take K as small as 491. However, the results of this paper allow us to say 
much more: applying Theorem [6] in the case where e = j tells us K can actually be as small as 399; applying this 
same result with a smaller s may very well yield even smaller acceptable values of K. 

Conclusions and future work 

For many decades, numerical linear algebraists have intensely studied the problem of estimating the extreme 
eigenvalues of a self-adjoint positive semidefinite matrix. As a result, we have many excellent algorithms — power 
methods, QR iterations, etc. — for quickly and accurately computing these eigenvalues for even large matrices. How- 
ever, these algorithms cannot be directly applied to many emerging compressed sensing and frame theory problems, 
since these problems involve estimating the singular values of all submatrices of a given size, leading to a combina- 
torial nightmare. What we need are new, computationally tractable algorithms that allow us to estimate the singular 
values of all of these submatrices simultaneously. 

We believe the techniques of this paper are a step in that direction. However, as seen in the last section, these 
techniques are still computationally expensive. Much of this is the fault of our e-net for §^~' which, despite having 
far fewer elements than an e-net for the entire sphere S M_1 , still contains an enormous number of points for even 
small choices of M. As such, with regards to taking the results of this paper forward, the most critical problem that 
needs to be addressed is that of designing more efficient e-nets for . Indeed, this seems to be a fundamental 
problem in analysis: how many nonnegative, nonincreasing unit norm functions do I need to well-approximate every 
such function, where the quality of the approximation is measured in terms of chordal distance? Of course, many 
other important questions arise. Can the optimal NERF bound estimates of Theorems Q] and [2] be improved? Is there 
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a better method of performing the quantization that led to Lemma|5]? Here, we have already tried "rounding down" 
as an alternative, but found it to be slightly inferior to the "rounding up" approach we presented here. Moreover, how 
much of the calculation of Theorem|6]can be performed analytically, lessening our computational burden? 
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